Manticore: A 4096-Core RISC-V Chiplet Architecture for Ultraefficient Floating-Point Computing
نویسندگان
چکیده
Data-parallel problems demand ever growing floating-point (FP) operations per second under tight area- and energy-efficiency constraints. In this work, we present Manticore, a general-purpose, ultraefficient chiplet-based architecture for data-parallel FP workloads. We have manufactured prototype of the chiplet’s computational core in Globalfoundries 22FDX process demonstrate more than 5x improvement energy efficiency on intensive workloads compared to CPUs GPUs. The compute capability at high area is provided “Snitch: A tiny pseudo dual-issue processor efficient execution workloads,” IEEE Trans. Comput., containing eight small integer cores, each controlling large unit (FPU). supports two custom ISA extensions: SSRs extension elides explicit load store instructions by encoding them as register reads writes (“Stream semantic registers: lightweight RISC-V achieving full utilization single-issue cores,” Comput.). repetition decouples from FPU allowing be issued independently. These extensions allow minimize its instruction fetch bandwidth saturate FPU, above 90%, with 40% dedicated FPU.
منابع مشابه
A 32-bit FPGA-based Single Precision Floating-point Hybrid CORDIC Processor Based on RISC Architecture
This paper presents the design process of a 32-bit single precision floating-point Hybrid Coordinate Rotation Digital Computer (CORDIC) processor on Field Programmable Gate Array (FPGA) which used to perform the mathematical computation operations for various elementary functions such as trigonometry and hyperbolic functions, exponential, natural logarithm, square root as well as multiplication...
متن کاملHardware accelerated approach for floating-point multiplication on 32-bit pipelined RISC-V processor
Implementing hardware support for all extensions of the RISC-V Instruction Set Architecture inside a processor would lead to avoidable area and power consumption for applications that rarely utilize a particular extension. In this paper, authors have first suggested a modified 3-stage pipeline alternative to the ZSCALE processor (32-bit) by UC Berkeley. Subsequently a hardware-accelerated appro...
متن کاملA RISC-V Extension for the Fresh Breeze Architecture
We report on a RISC-V extension for a novel multi-core computer organization able to execute applications with high performance and energy efficiency. Novel features of this architecture include support for data objects represented by trees of 128-byte memory chunks, and hardware implementation of task scheduling and load balancing. We call our project Fresh Breeze1 in view of its novelty and p...
متن کاملA Transprecision Floating-Point Platform for Ultra-Low Power Computing
In modern low-power embedded platforms, the execution of floating-point (FP) operations emerges as a major contributor to the energy consumption of compute-intensive applications with large dynamic range. Experimental evidence shows that 50% of the energy consumed by a core and its data memory is related to FP computations. The adoption of FP formats requiring a lower number of bits is an inter...
متن کاملLabeled RISC-V: A New Perspective on Software-Defined Architecture
Traditional computer architectures are insufficient to convey important high-level requirements of applications to the hardware. These requirements include QoS and security, which are extremely important to data centers in the cloud era. To guarantee better QoS in data centers, we propose a new computer architecture LvNA (Labeled von Neumann Architecture) that leverages labeling mechanism and p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Micro
سال: 2021
ISSN: ['1937-4143', '0272-1732']
DOI: https://doi.org/10.1109/mm.2020.3045564